Picture for Abby D'Cruz

Abby D'Cruz

Evaluating whether AI models would sabotage AI safety research

Add code
Apr 27, 2026
Viaarxiv icon

UK AISI Alignment Evaluation Case-Study

Add code
Apr 01, 2026
Viaarxiv icon